Promoters having a modified transcription efficiency and derived from methyltrophic yeast

ABSTRACT

The present invention relates to a deoxyribonucleic acid (DNA) comprising at least one promoter sequence, which is derived from a wild-type promoter of a methyltrophic yeast, whose transcription efficiency is modulated in comparison to the efficiency of the wild-type promoter by inserting or modifying a DNA binding site. The invention also relates to host cells, expression vectors, kits and methods for producing proteins while using the inventive DNA, as well as to different uses of the same and to a method for producing expression vectors.

BACKGROUND OF THE INVENTION

The present invention concerns a deoxyribonucleic acid (DNA) comprising at least one promoter sequence, which is derived from a wild type promoter of a methylotrophic yeast, whose transcription efficiency is modulated compared to the efficiency of the wild type promoter. Further, the present invention concerns host cells, expression vectors, kits and processes for the production of proteins with the use of the DNA according to the invention and various applications thereof and a process for the production of expression vectors.

Because of their capability for mammal-like co- and posttranslational modification of proteins, fungi and in particular yeasts are attractive systems for the production of recombinant proteins. For the production of recombinant proteins, the coding sequence of a gene for a protein of interest is often expressed under the control of a suitable heterologous promoter. The inducible promoters from methylotrophic yeasts have proved especially advantageous here.

Methylotrophic yeasts include for example the genera Candida, Hansenula, Pichia and Torulopsis. Their promoters are characterised by unusually strong transcription induction and good and simple controllability. Particularly advantageous promoters are the promoters responsible in vivo for the regulation of the methanol metabolism. Methylotrophic yeasts catalyse the oxidation of methanol to carbon dioxide via the intermediate stages of formaldehyde and formate, these reactions being catalysed respectively by an alcohol or methanol oxidase (Aoxp or Moxp), a formaldehyde dehydrogenase (Fmdp) and a formate dehydrogenase (Fmdhp). The hydrogen peroxide arising in the first step is degraded by catalase. The C1 compound is assimilated via the transketolase reaction of xylulose-5(P) with formaldehyde. This reaction is catalysed by dihydroxyacetone synthase (Dhasp). The enzymes Aoxp or Moxp, Fmdhp and Dhasp expressed under the control of the AOX or MOX, FMD and DHAS promoters represent up to 40% of the total cell protein in methanol-induced cells. These promoters are thus also described in specialist circles as superpromoters. Most of the promoters for the said genes are repressible with glucose and are therefore very controllable individually.

A further very strong promoter from methylotrophic yeasts is the TPS1 promoter, which controls the expression of the trehalose-6-phosphate synthase gene and is heat-inducible. Tps1p catalyses the conversion of glucose-6-phosphate (GLU6P) and UDP-glucose (UDPG) to trehalose-6-phosphate and UDP. During a one-hour thermal shock from 27° C. to 40° C., treated yeast cells accumulate trehalose and in this way build up increased heat tolerance.

In addition, methylotrophic yeasts have unusually strong constitutive promoters. Two of these very strong constitutive promoters are GAPDH (glyceraldehyde phosphate dehydrogenase) and PMA1 (plasma membrane-bound H⁺-ATPase 1).

In the recombinant preparation of heterologous proteins, both the activity of the promoter sequence, its regulation and also its integration into the genome of a suitable host cell determine the economic viability and efficiency of the process. In general, especially high transcription activity is desirable. In many cases, however, excessively high transcription activity can lead to the host cell being damaged or dying, before all heterologous gene products of the co- and post-translational modification necessary for the activity could be introduced. In particular, the quantity of toxic proteins must be controlled in such a way that the toxic action is minimised until the optimal yield of transcribed and post-translationally modified proteins. Such control is also difficult to effect with the known promoters inducible with methanol.

BRIEF DESCRIPTION OF THE DRAWINGS

The figures and examples illustrate the invention. The invention is illustrated in more detail below with reference to the figures, which show the following:

FIGS. 1A and 1B show the expression efficiency for strains with the reporter genes lacZ (FIG. 1A) and phytase (FIG. 1B) under the control of the FMD promoter variants (FMD id1 and FMD id1-4) according to the invention compared graphically with the wild type promoter (FMD wt, strain with wild type promoter) after 24 hours' growth on glucose (light grey) or glycerine (dark grey)).

Admittedly the value for the FMD id1 promoter after glycerine induction is relatively low both with lacZ and also with phytase as reporter gene, however it is very high in absolute terms (compared to the wt FMD promoter) and moreover is about as high as under glucose conditions. This indicates that FMD id1 is a constitutive promoter with regard to the carbon sources glycerine and glucose. On glucose, FMD id1-4 shows similar values to those for the wt FMD Promoter, however on glycerine it shows markedly higher values. Hence both FMD id1 and also FMD id1-4 show modified regulation and strength compared to the wt FMD promoter.

FIG. 2 shows the regulation factors (the quotient of the measured activity values on glycerine and glucose) of the different FMD promoter variants id1 and id1-4 with lacZ (dark grey) or phytase (light grey) as reporter gene compared to the regulation factor of the wild type FMD promoter (FMD wt). A quotient of 1 means that the promoter is constitutive with regard to the carbon sources studied. Values >1 indicate the extent of glycerine derepression. Thus, compared with the wt FMD promoter, the FMD id1 promoter is constitutive, while the FMD id1-4 promoter is more strongly derepressible.

FIG. 3 shows an X-Gal overlay test with strains which had previously been grown at 37° C. on YNB/glucose plates. Both with the MOXplasmid pC11 MOX idAC1 and also with the TPS1 promoter, it was found that the palindromisation of the first (nearest to the TATA box) ATF/CREB sequence-like motif under the tested condition resulted in a visible increase in the promoter activity, compared with the respective wild type promoter (plasmid pC11 MOX wt, plasmid pC11 TPS wt).

MOX Promoter:

-   -   Plasmid pC11 MOX idAC1: natural (cryptic) AC1 sequence         (atggtgtcat) idealised by two exchanges to atgacgtcat (ATF/CREB         binding site). The altered bases are underlined (also in the         following examples).     -   Plasmid pC11 MOX idY1: natural (cryptic) Y1 sequence         (cgaatgtaat) idealised by four exchanges to attacgtaat (YAP         binding site).         TPS1 Promoter:     -   Plasmid pC11 TPS idAC1: natural (cryptic) AC1 sequence         (atggtgagat) idealised by four exchanges to atgacgtcat (ATF/CREB         binding site).     -   Plasmid pC11 TPS idC1: natural (cryptic) C1-Sequenz (attgaccaat)         idealised by two exchanges to attgcgcaat (C/EBP binding site)]

FIGS. 4 to 6: In FIGS. 4 to 6, the TATA box and individual cryptic binding sites are represented as follows.

-   -   Double underlined: presumed TATA BOX.     -   Framed with continuous line: possible cryptic ATF/CREB binding         sites for bZip proteins.     -   Framed with broken line: possible cryptic YAP binding sites for         bZip proteins.     -   (AC=cryptic ATF/CREB binding site; Y=cryptic YAP binding site;         C=cryptic C/EBP binding site).—The bases with homology to the         consensus sequence are underlined.     -   The numbering is based on the translation start. All the         sequences quoted end with guanine. This guanine is in each case         the first base of an EcoR1 cleavage site gaattc. Each start         codon follows directly thereafter. Hence, based on the atg, the         said guanine has the number (−6).

FIG. 4 shows the sequence of the wild type FMD promoter with presumed TATA box and possible cryptic binding sites for bZip proteins used in the examples.

The sites 1 to 4 framed with continuous lines, which are all cryptic variants of the ATF/CREB sequence, were modified. The idealised variants were designated as id1 to id4.

AC: Cryptic ATF/CREB sequence.

id# The cryptic ATF/CREB sequence of each respective number was converted into an ideal ATF/CREB sequence.

FIG. 5 shows the sequence of the wild type MOX promoter with presumed TATA box and possible cryptic binding sites for bZip proteins used in the examples.

The site 1 framed with a continuous line (idAC1) and the site 1 framed with a short-dashed line (idY1) were modified.

AC: Cryptic ATF/CREB sequence.

idAC# The cryptic ATF/CREB sequence of each respective number was converted into an ideal ATF/CREB sequence.

Y: Cryptic YAP binding site.

idY# The cryptic YAP binding site of each respective number was converted into an ideal YAP binding site.

FIG. 6 shows the sequence of the wild type TPS1 promoter with presumed TATA box and possible cryptic binding sites for bZip proteins used in the examples.

The site 1 framed with a continuous line (idAC1) and the site 1 framed with a short-dashed line (idC1) were modified.

AC: Cryptic ATF/CREB sequence.

idAC# The cryptic ATF/CREB sequence of each respective number was converted into an ideal ATF/CREB sequence.

C: Cryptic C/EBP binding site.

idC# The cryptic C/EBP binding site of each respective number was converted into an ideal C/EBP binding site.

FIGS. 7A, 7B and 7C show gene maps of the lacZ reporter plasmids pCH-FMD, pC11-MOX and pCH-TPS1, which differ only in the respective promoter stated in the plasmid name.

FIG. 8 shows ideal binding sites for bZip proteins, the first column giving the designation, the middle column the idealised sequence and the right-hand column the biological origin of the sequence.

DETAILED DESCRIPTION OF THE INVENTION

The purpose of the present invention is therefore to provide a DNA comprising a promoter sequence which meets the individual requirements of methylotrophic yeast transcription systems.

According to the invention, this problem is solved by a DNA which comprises a promoter sequence which is derived from a wild type promoter of a methylotrophic yeast and whose transcription efficiency is modulated compared to the efficiency of the wild type promoter by introduction or modification of a DNA binding site.

It has been discovered that the transcription efficiency of the promoters of methylotrophic yeasts can be modulated. Such a modulation of the transcription efficiency can take place both by intensification and also by diminution of the activity of the promoter sequence and/or increased or decreased integration of the promoter sequence into the genome of a suitable host cell.

A wild type promoter from a methylotrophic yeast should be understood to mean a promoter which displays the same DNA sequence and the same transcription efficiency as the promoters from a naturally occurring methylotrophic yeast. Suitable wild type promoters are known to the skilled person from the specialist literature, through public deposition and through publication in databases. Thus for example the promoters FMD, MOX and TPS1 are disclosed in the European patent applications EP299108, EP173378 and EP1151112; the DHAS promoter is also described in EP173378. The promoters GAPDH and PMA1 are respectively in U.S. patent specification U.S. Pat. No. 935,789 and the document by Cox et al., Yeast 2000, September, 30; 16(13): 1191-203.

Examples of sequences of certain wild type FMD, MOX and TPS1 promoters can be taken from FIGS. 4 to 6. Since sequence analysis, in particular ten and more than twenty years ago sometimes contained errors, but possible errors are readily recognisable by checking with standardised routine methods, wild type promoters in the sense of the invention are understood to mean not only the wild type promoters disclosed by description in the state of the technology, but also the actually accessible wild type promoters.

Transcription efficiency should be understood to mean the production of transcript, i.e. mRNA per unit time. For the purpose of this application, it is determined as the quantity of a heterologous protein which is expressed in a suitable host cell per unit time under the control of a promoter, it being assumed that the translation efficiency depends only on the quantity of transcript available. The transcription efficiency is for example determined by standard procedures by quantitative determination of the heterologous protein or by evaluation of the signals of a reporter gene such as lacZ or phytase at different times.

Especially preferred is a DNA according to the invention, with which the transcription efficiency of the promoter sequence is increased compared to the efficiency of the wild type promoter by the insertion of a DNA binding site. In one embodiment, the DNA sequences according to the invention even display a considerably increased transcription efficiency compared to the wild type promoters. This is all the more surprising since the wild type promoters of methylotrophic yeasts already count among the most efficient promoters of all. A further increase, in particular so marked an increase, is therefore extremely surprising.

In a preferred embodiment, the invention concerns a DNA whose promoter sequence displays a transcription efficiency increased by at least the factor 1.5, preferably by at least the factor 2, and especially preferably by at least the factor 2.5, compared to the wild type promoter.

In another preferred embodiment, the invention concerns a DNA whose promoter sequence displays a decreased transcription efficiency compared to the wild type promoter, preferably a transcription efficiency decreased by at least the factor 0.5.

The promoter sequences are preferably a DNA according to the invention derived from a promoter from one of the genera Candida, Hansenula, Pichia or Torulopsis, especially preferably from a promoter from Hansenula polymorpha or Candida boidinii, and quite especially preferably from Hansenula polymorpha.

In a further preferred embodiment, the promoter sequence of a DNA according to the invention is derived from promoters of the group of the MOX, FMD, TPS1, DHAS, GAPDH and PMA1 promoters or similarly acting promoters from methylotrophic yeasts of the aforementioned genera.

Preferably a DNA according to the invention comprises at least one promoter sequence which is derived from a wild type promoter by the insertion, deletion and/or the exchange of at least one base, preferably by the exchange of one or two bases.

It has surprisingly been discovered that the wild type promoter sequences of methylotrophic yeasts have DNA regions which are similar but not identical to the DNA binding sites for transcription factors of the bZip protein class, these DNA regions naturally having no DNA binding function in the methylotrophic yeast. These so-called cryptic, i.e. similar but non-functional DNA binding sites for transcription factors of the bZip protein class, are a preferred option for the modification of the promoter transcription efficiency. It has been discovered that a modification of these cryptic DNA binding sites by deletion, insertion or exchange of individual or several base pairs can considerably modify the promoter transcription efficiency of methylotrophic yeasts. At the same time, in particular the formulation of this DNA binding site as a palindromic DNA shows a strong modification of the transcription activity.

In an especially preferred embodiment, the present invention therefore covers a DNA whose promoter sequence displays at least one DNA binding site, preferably a palindromic DNA binding site for transcription factors of the bZip protein class.

Transcription factors of the bZip protein class are for example described in Suckow et al., J. Mol. Biol. (1998) 276, 887-902. BZip proteins display a basic region which is responsible for the DNA recognition, and a C-terminal leucine “zip fastener” which is responsible for the specific dimer formation of these factors. Corresponding DNA binding sites for the bZip proteins are mostly palindromic or pseudopalindromic sequences with 10 or 9 base pairs.

Furthermore, it has been discovered that the transcription efficiency of promoter sequences of methylotrophic yeasts can be modulated by alteration of DNA regions whose motif is similar to the palindromic DNA binding sites of Zinkfinger proteins. These DNA regions can be modified by deletion, insertion or exchange of at least one base pair, as a result of which a palindromic DNA binding site for Zinkfinger proteins is preferably formed. In a further preferred embodiment, the promoter sequence of a DNA according to the invention therefore comprises at least one DNA binding site, preferably a palindromic DNA binding site, for Zinkfinger proteins.

Both the DNA motifs for transcription factors of the bZip protein class and also for Zinkfinger proteins are familiar to the skilled person in the field of molecular biology. Thus, in implementation of the present invention the skilled person can search the sequences of wild type promoters for DNA sequences which resemble the said motifs and by standard procedures modify these into functional and/or identical binding sites for bZip proteins or Zinkfinger proteins.

Especially preferred are DNA sequences according to the invention, whose promoter sequence displays at least one DNA binding site which essentially corresponds to the ATF/CREB binding site, the YAP binding site, the C/EBP binding site, the AP1 binding site or the G box. Examples of embodiments of these preferred binding sites are shown in FIG. 8 (example for AP1 or G box). Here “essentially” means that per strand not more than three, preferably two bases or only one base deviate(d) from the sequence at the said binding site.

In an especially preferred embodiment, the promoter sequence of a DNA according to the invention DNA comprises at least one palindromised DNA sequence, which is derived from the sequences given in SEQ ID No.:1-SEQ ID No.: 23. TABLE 1 Reference SEQ ID No: Sequence designation 1 CACTGGTCAC FMD AC1 2 ATGACTTTGG FMD AC2 3 CTGACAGGGT FMD AC3 4 GTGACGGAAT FMD AC4 5 ATGACAGCAT FMD AC5 6 TGGAGGTCAA FMD AC6 7 ATGAGCCAGT FMD AC7 8 TCCAGATCAT FMD AC8 9 GGAGGCTCAG FMD AC9 10 ATTTCGTAAT FMD Y1 11 ATGGTGTCAT MOX AC1 12 CTTTGGTCAT MOX AC2 13 CGAATGTAAT MOX Y1 14 GAAGCGTAAT MOX Y2 15 GCCCGGTAAT MOX Y3 16 ATGGTGAGAT TPSI AC1 17 ATGAAAGGCT TPSI AC2 18 ATGAATCTCC TPSI AC3 19 ATGACGATAT TPSI AC4 20 ATTGACCAAT TPSI C1 21 ATTGCACCCA TPSI C2 22 GATTCGTAAT TPSI Y1 23 TTTTCGTCAT TPSI AC5

Table 1 shows the preferred cryptic bZip sequences in the promoters FMD, MOX and TPS1 according to FIGS. 4 to 6.

Preferably, the palindromised sequence comprises at least 8 to 20, preferably 8 to 12, and especially preferably 10 bases.

The palindromised sequence can be symmetrical or unsymmetrical. Preferably it is symmetrical. The palindromised sequence preferably does not deviate from a completely palindromic sequence by more than 2 bases per strand, especially preferably by only one base and quite especially preferably not at all.

The DNA molecules according to the invention can for example by produced synthetically by common procedures, or else isolated from suitable DNA libraries and then mutated. The preparation of such libraries is likewise known to the skilled person. For example, an isolation is effected by preparing a probe with a length of at least 200-400 bp and the sequence of a wild type promoter of a methylotrophic yeast and a DNA library, in particular a genomic DNA library, is screened therewith. Such a probe can be prepared by means of a PCR (Polymerase Chain Reaction) with the use of suitable primers, which are each preferably at least 20-21 bp long and possess suitable sequences, and genomic or cDNA from methylotrophic yeasts as the “template”.

Probes can for example be synthesised or produced by fragmentation of available wild type promoter DNA. It is also possible to screen directly with probes corresponding to parts of the promoter sequence. However, such a procedure is less preferable on account of defective conservation of the sequence within non-coding sections.

Furthermore, the DNA molecules according to the invention can comprise at least one DNA sequence for a homologous and/or heterologous gene lying under the transcription control of the promoter sequence.

A “heterologous gene” should be understood to mean the coding region of a structural gene which either does not lie under the control of the endogenous (homologous) promoter or is not expressed in the organism from which it is derived, or is expressed neither under the control of the endogenous promoter nor in the original organism.

Furthermore, the DNA molecule according to the invention can comprise a DNA sequence coding for a signal peptide, which ensures the export of the expressed protein, the DNA sequence coding for the signal peptide preferably being directly bound to the heterologous gene to be expressed. For the secretion and modification of many eukaryotic proteins, it is necessary to fuse the protein sequence at the N-terminus with a signal sequence, in order to direct the polypeptides into the secretion apparatus. For example, components from the S. occidentalis gene GAM1 or from a hormonal gene of the crab Carcinus maenas (CHH), which were successfully utilised for the secretion of hirudin (Weydemann et al., 1995) can be considered for this. Preferably, a DNA according to the invention comprises a secretion signal selected from the group GAM, GAM-kex2, CHH-kex2, MFα-prepro and the chicken lysozyme secretion signal. The aforementioned secretion signals are known to the skilled person from the state of the technology, for example from the European patent specifications EP 394,538 B1 and EP 716,705 B1 and the European patent application EP 725,822 A1. Further, the DNA according to the invention can comprise a terminator element, which contains signal structures for the RNA polymerase which lead to the termination of the transcription. Examples of usable terminator elements are the MOX or PHO1 terminator from H. polymorpha.

Also an object of the invention is a host cell which contains at least one DNA according to the invention, the host cell being a prokaryotic or a eukaryotic cell.

For example, the eukaryotic cell can be a plant cell. Preferably the cell is a fungal cell, especially preferably a yeast cell. Possible host cells for the implementation of the present invention are for example filamentous fungi such as for example Aspergillus, Neurospora, Mucor, Trichoderma, Acremonium, Sordaria and Penicillium or yeasts such as Saccharomyces, Hansenula, Pichia, Torulopsis, Kluyveromyces, Schwanniomyces, Yarro-wia, Arxula, Trichosporon and Candida.

Quite especially preferred is a methylotrophic yeast cell, especially one of the genus Candida, Torulopsis, Hansenula or Pichia. Most preferred is Hansenula polymorpha.

The present invention also concerns expression vectors, preferably plasmids, in which the vector comprises a DNA according to the invention.

In a preferred embodiment, such expression vectors include the phytase gene or LacZ as a reporter reading frame.

The present invention in addition provides a kit comprising:

-   (a) an expression vector according to the invention, which is     suitable for the incorporation of a DNA which codes for a     recombinant protein, and -   (b) a host cell suitable for the production of the recombinant     protein.

The incorporation of the DNA coding for a recombinant protein can be effected by all cloning methods known in the state of the technology. These are however not all described in detail, since they are part of the skilled person's normal stock in trade.

Further, a kit is provided, comprising

-   (a) an expression vector according to the invention, which is     suitable for the incorporation of a DNA which codes for a     recombinant protein, and -   (b) a host cell which is suitable for the production of the     recombinant protein under the transcription control of the promoter     sequence.

The DNA molecules, host cells, expression vectors and kits according to the invention can be used for the recombinant expression of a gene under the control of the promoter sequence for the production of one or several proteins.

The present invention thus also concerns the use of the objects according to the invention for the production of proteins.

“Recombinant expression in a suitable host cell” should be understood to mean all expression methods in known expression systems known in the state of the technology, which could be used here. These are however not all described in detail, since they are part of the skilled person's normal stock in trade.

Also an object of the invention is a process for the production of one or several proteins, comprising:

-   (i) the cloning of at least one DNA which codes for a recombinant     protein, into an expression vector according to the invention, so     that the cloned DNA lies under the transcription control of the     promoter sequence; -   (ii) the introduction of the expression vector obtained in (i) into     a host cell which is suitable for the production of the recombinant     protein, optionally after induction of the promoter sequence; -   (iii) the cultivation of the host cell obtained in (ii); -   (iv) and optionally the induction of the promoter sequence in a     manner in itself known.

In a preferred embodiment, the process according to the invention includes the following steps:

-   (i) the introduction of an expression vector according to the     invention into a host cell which is suitable for the production of     the recombinant protein, optionally after induction of the promoter     sequence; -   (ii) the cultivation of the host cell obtained in (i); -   (iii) and optionally the induction of the promoter sequence in a     manner in itself known.

A further aspect of the present invention concerns processes for the production of the expression vectors according to the invention, comprising

-   (a) the insertion of a DNA according to the invention into an     expression vector; or -   (b) the modification of a promoter sequence of a methylotrophic     yeast in an expression vector by insertion, preferably deletion     and/or exchange of at least one base pair in a manner it itself     known, so that a palindromic binding site is produced, preferably a     palindromic binding site for transcription factors of the bZip     proteins or for Zinkfinger proteins; and -   (c) optionally the introduction of a heterologous gene in a manner     in itself known into an expression vector, so that the heterologous     gene lies under the transcription control of the DNA from (a) or of     the modified promoter sequence from (b).

EXPERIMENTAL EXAMPLES

Introduction

The following examples show the change in the transcription efficiency of promoters of methylotrophic yeasts such as the FMD, MOXund TPS1 promoters due to conversion of individual or several cryptic DNA binding sites into their palindromic and thus formally idealised derivatives.

Preparation of Modified Wild Type Promoters

The wild type promoters to be modified (see FIGS. 4 (FMD), 5 (MOX), and 6 (TPS1)) were transferred into pUC18 and there deliberately modified in a manner in itself known by means of PCR mutagenesis. After verification of the exchanges by sequence analysis, the modified promoters were subcloned into expression plasmids which contained either phytase or lacZ as reporter reading frames. The two series of reporter plasmids created in this way differ only in their promoter sequences (see for example for lacZ as reporter gene the FIG. 7A (FMD promoter), FIG. 7B (MOX promoter) and FIG. 7C (TPS1 promoter)).

Integrative Test System

In order to be able to compare promoters with one another objectively, the reporter gene doses of the strains to be compared must be comparable. The following integrative test system was therefore constructed, which is suitable both for lacZ and also for phytase reporter plasmids. H. polymorpha was transformed with each reporter plasmid and 48 strains with genomically integrated plasmid DNA were created per construct. These sets of 48 strains were then subjected to a reporter protein test. This test yielded two pieces of information: firstly the construct-determined reporter gene expression strengths under the tested condition as an endpoint determination, and secondly the identification of the two strains in each set with the highest reporter gene expression rates. These strains were then subjected to further comparative experiments. Furthermore, the copy number of the integrated reporter plasmid in them was determined, in order to exclude the possibility that of any intensified promoter activities being attributable to a gene dosage effect.

Determination of FMD Promoter Variants with Increased Activity and/or Modified Regulation

The strains with id1 or id1-4 in the strain designation contained expression cassettes with FMD promoter variants with deliberate modifications in the region of cryptic bZip binding sites (here: in the region of the ATF/CREB sequences 1 to 4). The various promoters were tested both in combination with lacZ and also in combination with the phytase gene.

For the quantification of the β-galactosidase, ONPG measurements (ONPG: ortho-nitrophenyl-β-D-galactopyranoside; J. H. Miller (1972) Experiments in Molecular Genetics. Cold Spring Harbour Laboratory Press. Cold Spring Harbour, N.Y., USA) were carried out with intracellular soluble fractions.

For phytase, the phytase test was carried out with culture supernatants (A. F. Mayer, K. Hellmuth, H. Schlieker, R. Lopez-Ulibarry, S. Oertel, U. Dahlems, A. W. M. Strasser and A. P. G. M. van Loon (1999) An expression System matures: a highly efficient and cost-effective process for phytase production by recombinant strains of Hansenula polymorpha. Biotechnology and Bioengineering 63, 373-381; L. Pasamontes, M. Haiker, M. Wyss, M. Tessier and A. P. G. M. van Loon (1997) Gene cloning, purification and characterization of a heat-stable phytase from the fungus Aspergillus fumigatus. Appl. Environ. Microbiol. 63, 1696-1700).

Independently of the reporter gene, two different promoter characteristics were analysed: strength and regulation. The determination of the relative copy numbers of the integrated plasmids showed that the compared strains with modified promoters contained fewer or the same number of copies as the respective primary strain with the wild type promoter.

Strength of the Modified FMD Promoters

For the assessment of each FMD promoter modified according to the invention compared to the strength of the wild type FMD promoter, the following experiment was performed. The strongest strain for each reporter plasmid was cultivated for 48 hours on glucose or on glycerine as the carbon source. After 4, 24 and 48 hours, cells (lacZ) or culture supernatant (phytase gene) were harvested. For lacZ, starting from the harvested cells, intracellular soluble fractions were prepared, with which an ONPG measurement was then performed for the determination of the specific β-galactosidase activity. For phytase, a phosphatase test, in which the harvested culture supernatants were used directly, was performed. FIGS. 1 A and B show the absolute measured values for the strains tested after 24 hours' growth on glucose (FIG. 1A) and glycerine (FIG. 1B). Both with lacZ and also with phytase gene, peak values which lie markedly above the values attained with the wild type FMD promoter are attained with the modified promoters. This effect was in general more pronounced with lacZ (FIG. 1B) than with phytase gene (FIG. 1A). This occurs because phytase is a secretory glycoprotein, for which a large number of transport and modification steps must be completed between the transcription of the gene and the appearance of the active gene product in the culture supernatant, which are not necessary for the generation of the cytosol-located β-galactosidase. Both with phytase gene and also with lacZ, individual FMD promoter variants reach higher activities on glucose than the wild type promoter on glycerine. In the case of lacZ, the peak values attained with the wild type promoter were exceeded by a factor of up to 3.5, and in the case of phytase gene by a factor of up to 2.5 (see FIGS. 1A and 1B).

Regulation of Modified FMD Promoters

The FMD promoter variants tested showed regulation deviating to some extent from that of the wild type promoter. As a measure for the regulation, the quotient of the values measured after 24 hours' cultivation on glycerine and glucose was generated. A factor of 1 signifies constitutivity, based on the carbon sources studied. Higher factors indicate that the given promoter displays higher activity on glycerine than on glucose. The order of the regulation factors measured with the various FMD promoter variants is qualitatively identical with both reporter genes (lacZ and phytase gene) (FIG. 2): id1-4>wt>id1.

The promoter id1 is approximately constitutive, while the promoter id1-4 displays a higher regulation than the wild type FMD promoter with both reporter genes (see FIG. 2).

Thus it has been shown that according to the invention both the strength and also the regulation of methylotrophic promoters, in this case the FMD promoter, can be significantly modified.

MOX and TPS1 Promoter Variants with Increased Activity on Media with Glucose as Carbon Source

To demonstrate the transferability of the modulability of efficiency demonstrated above for FMD promoters to other promoters of methylotrophic yeasts, the transcription efficiency of a MOX promoter and a TPS1 promoter was also modulated according to the invention. In the sequence of the MOX promoter of H. polymorpha, five cryptic binding sites for bZip proteins were identified, namely two motifs with similarity to the ATF/CREB sequence and three motifs with similarity to the YAP binding site (see FIG. 5). By analogy to the modifications carried out on the FMD promoter, two of these sequence motifs were palindromised. The resulting promoter variants MOX-idA/CI and MOX-idY1 and the wild type MOX promoter were transferred into the lacZ reporter plasmid, as already described for the FMD promoter, in such a manner that they took over control of the reporter gene.

In the sequence of the TPS1 promoter, a total of eight possible cryptic bZip motifs were identified (see FIG. 6). Five of the motifs described in this example have similarity to the ATF/CREB sequence, two to the C/EBP binding site and one to the YAP binding site. For this promoter, two variants were constructed, each with a palindromised sequence motif, TPS-idA/CI and TPS-idCI.

Further cryptic bZip or Zinkfinger motifs are accessible to the skilled person through more detailed analysis, e.g. by recourse to computer-supported comparison algorithms.

48 strains were generated per construct. As described for the FMD constructs, the strains with the highest reporter gene activities were firstly determined by means of an X-Gal overlay test, and these were then analysed in more detail. FIG. 3 shows an X-Gal overlay test of five of these strains from each set, which had previously been grown at 37° C. on YNB/glucose plates. Both with the MOX and also with the TSP1 promoter, it was found that the palindromisation of the first (nearest to the TATA box) ATF/CREB sequence-like motif under the tested condition resulted in a visible increase in the promoter activity, compared to the respective wild type promoter (see FIG. 3). With other carbon sources tested, this effect could not be observed. This result shows that the transcription properties of MOX and TPS1 promoters can also be modified by modification, in particular palindromisation, of already present cryptic DNA binding sites. 

1. A DNA comprising at least one promoter sequence which is derived from a wild type promoter of a methylotrophic yeast, characterised in that the transcription efficiency of this promoter sequence compared to the efficiency of the wild type promoter is increased by introduction or modification of a DNA binding site.
 2. The DNA according to claim 1, characterised in that the transcription efficiency of the promoter sequence compared to the efficiency of the wild type promoter is increased by at least the factor 1.5, preferably by at least the factor 2, especially preferably by at least the factor 2.5.
 3. The DNA according to claim 1, characterised in that the promoter sequence is derived from a promoter of one of the genera Candida, Hansenula, Pichia or Torulopsis.
 4. The DNA according to claim 3, characterised in that the promoter sequence is derived from a promoter from Hansenula polymorpha.
 5. The DNA according to claim 1, characterised in that the promoter sequence is derived from a promoter which is selected from the group consisting of the promoters MOX, FMD, TPSI, DHAS, AOX, GAPDH and PMAI.
 6. The DNA according to claim 1, characterised in that the promoter sequence is derived from a wild type promoter by the
 7. The DNA according to claim 1, characterised in that said promoter sequence has at least one DNA binding site, preferably a palindromic DNA binding site, for transcription factors of the bZip protein class.
 8. The DNA according to claim 1, characterised in that said promoter sequence has at least one DNA binding site, preferably a palindromic DNA binding site, for zinc finger proteins.
 9. The DNA according to claim 1, characterised in that said promoter sequence has at least one DNA binding site which essentially corresponds to the ATF/CREB binding site, the YAP binding site, the C/EBP binding site, the AP1 binding site or the G box.
 10. The DNA according to one of claims 7 to 9, characterised in that its promoter sequence contains at least one palindromised DNA sequence which is derived from a sequence stated in SEQ ID No.: 1-SEQ ID No.: 23 and differs from this in not more than 2 bases.
 11. The DNA according to claim 10, characterised in that the palindromised sequence comprises at least 8 to 20, preferably 8 to 12, and more preferably 10 bases.
 12. The DNA according to claim 11, characterised in that the palindromised sequence is symmetrical or unsymmetrical, preferably symmetrical, and preferably deviates from a completely palindromic sequence in not more than two bases per strand, especially preferably in only one base and quite especially preferably not at all.
 13. The DNA according to claim 1, characterised in that it further comprises at least one DNA sequence for a homologous and/or heterologous gene lying under the transcription control of the promoter sequence.
 14. The DNA according to claim 1, characterised in that it further comprises a DNA sequence coding for a secretion signal.
 15. The DNA according to claim 1, characterised in that the secretion signal is selected from the group GAM, GAM-kex2. CHH-kex2, MFα-prepro and the chicken lysozyme secretion signal.
 16. A host cell, characterised in that the host cell contains at least one DNA according to claim
 1. 17. The host cell according to claim 16, characterised in that said DNA is integrated into the genome of the host cell, or is present in the host cell as an extrachromosomal DNA molecule, preferably episomally.
 18. The host cell according to claim 16, characterised in that the host cell is a yeast cell, preferably a methylotrophic yeast cell, especially preferably a yeast cell of the genera Candida, Torulopsis, Hansenula or Pichia, especially preferably Hansenula polymorpha.
 19. An expression vector, preferably a plasmid, characterised in that said vector comprises a DNA according to claim
 1. 20. The expression vector according to claim 19, characterised in that the vector contains the phytase gene or lacZ as a reporter reading frame.
 21. A kit, comprising: (a) the expression vector according to claim 19 or claim 20, which is suitable for the cloning of a DNA which codes for a recombinant protein, and (b) a host cell suitable for the production of the recombinant protein.
 22. Use of the DNA according to claim 1 for the expression of a gene.
 23. Use of the DNA according to claim 1 for the production of at least one protein.
 24. A method for the production of one of at least one protein, comprising: (i) introduction of an expression vector comprising the DNA according to claim 1 into a host cell which is suitable for the production of the recombinant protein, optionally after induction of the promoter sequence; (ii) cultivation of the host cell obtained in (i); and (iii) and optionally induction of the promoter sequence in a manner in itself known, as a result of which the protein or proteins is/are produced by the host cell.
 25. A method for the production of at least one protein, comprising: (i) construction of an expression vector containing a DNA according to claim 12; (ii) introduction of an expression vector into a host cell which is suitable for the production of a recombinant protein, optionally after induction of the promoter sequence; (iii) cultivation of the host cell obtained in (ii); and (iv) and optionally induction of the promoter sequence in a manner in itself known, as a result of which the protein or proteins is/are produced by the host cell.
 26. A method for the production of expression vectors according to one of claims 19 or 20, comprising: (i) the modification of a promoter sequence of a methylotrophic yeast in an expression vector by insertion, deletion and/or exchange of, at least one base pair in a manner in itself known, so that a palindromic binding site is produced, preferably a palindromic binding site for transcription factors of the bZip proteins or for zinc finger proteins; and (ii) optionally the introduction of a heterologous gene in a manner in itself known into an expression vector, so that the heterologous gene lies under the transcription control of the DNA from (i).
 27. Use of the host cell according to claim 16 for the expression of a gene.
 28. Use of the host cell according to claim 16 for the production of at least one protein.
 29. Use of the kit according to claim 21 for the expression of a gene.
 30. Use of the kit according to claim 21 for the production of at least one protein. 